NSF PAR Search | NSF Public Access Repository

Note: When clicking on a Digital Object Identifier (DOI) number, you will be taken to an external site maintained by the publisher. Some full text articles may not yet be available without a charge during the embargo (administrative interval).
What is a DOI Number?

Some links on this page may take you to non-federal websites. Their policies may differ from this site.

Optimal Extragradient-Based Algorithms for Stochastic Variational Inequalities with Separable Structure

Yuan, Huizhuo; Li, Chris Junchi; Gidel, Gauthier; Jordan, Michael I; Gu, Quanquan; Du, Simon S (December 2023, Advances in neural information processing systems)

Full Text Available
Nesterov Meets Optimism: Rate-Optimal Separable Minimax Optimization

Li, Chris Junchi; Yuan, Angela; Gidel, Gauthier; Gu, Quanquan; Jordan, Michael (January 2023, International Conference on Machine Learning (ICML))

Full Text Available
A General Framework for Sample-Efficient Function Approximation in Reinforcement Learning

Chen, Zixiang Chen; Li, Chris Junchi; Yuan, Angela; Gu, Quanquan; Jordan, Michael I. (January 2023, International Conference on Learning Representations (ICLR))

Full Text Available
Learning Two-Player Mixture Markov Games: Kernel Function Approximation and Correlated Equilibrium

Li, Chris Junchi; Zhou, Dongruo; Gu, Quanquan; Jordan, Michael I. (January 2022, Advances in neural information processing systems)

Full Text Available
On Linear Stochastic Approximation: Fine-grained Polyak-Ruppert and Non-Asymptotic Concentration

Mou, Wenlong; Li, Chris Junchi; Wainwright, Martin J; Bartlett, Peter L; Jordan, Michael I (January 2020, Proceedings of Thirty Third Conference on Learning Theory)
null (Ed.)
We undertake a precise study of the asymptotic and non-asymptotic properties of stochastic approximation procedures with Polyak-Ruppert averaging for solving a linear system $$\bar{A} \theta = \bar{b}$$. When the matrix $$\bar{A}$$ is Hurwitz, we prove a central limit theorem (CLT) for the averaged iterates with fixed step size and number of iterations going to infinity. The CLT characterizes the exact asymptotic covariance matrix, which is the sum of the classical Polyak-Ruppert covariance and a correction term that scales with the step size. Under assumptions on the tail of the noise distribution, we prove a non-asymptotic concentration inequality whose main term matches the covariance in CLT in any direction, up to universal constants. When the matrix $$\bar{A}$$ is not Hurwitz but only has non-negative real parts in its eigenvalues, we prove that the averaged LSA procedure actually achieves an $O(1/T)$ rate in mean-squared error. Our results provide a more refined understanding of linear stochastic approximation in both the asymptotic and non-asymptotic settings. We also show various applications of the main results, including the study of momentum-based stochastic gradient methods as well as temporal difference algorithms in reinforcement learning.
more » « less
Full Text Available
On the diffusion approximation of nonconvex stochastic gradient descent

https://doi.org/10.4310/amsa.2019.v4.n1.a1

Hu, Wenqing; Li, Chris Junchi; Li, Lei; Liu, Jian-Guo (January 2019, Annals of Mathematical Sciences and Applications)
null (Ed.)
Full Text Available

Search for: All records